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Abstract. A novel approach is presented for the solution of instantaneous chemical equilibrium problems. The 
chemical equilibrium can be considered, due to its intrinsically local character, as a mapping of the three- 
dimensional parameter space spanned by the temperature, hydrogen density and electron density into many 
one-dimensional spaces representing the number density of each species. We take advantage of the ability of arti- 
ficial neural networks to approximate non-linear functions and construct neural networks for the fast and efficient 
solution of the chemical equilibrium problem in typical stellar atmosphere physical conditions. The neural network 
approach has the advantage of providing an analytic function, which can be rapidly evaluated. The networks are 
trained with a learning set (that covers the entire parameter space) until a relative error below 1% is reached. It 
has been verified that the networks are not overtrained by using an additional verification set. The networks are 
then applied to a snapshot of realistic three-dimensional convection simulations of the solar atmosphere showing 
good generalization properties. 
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^ 1. Introduction 



Molecules are usually found in highly dynamic systems 
(e.g., the solar atmosphere, winds of AGB stars, ...) and 
their formation is influenced by the time variation of 
the physical conditions in the medium. In many situa- 
tions the dynamical timescales are much slower than the 
timescales of molecular formation, and the approximation 
of Instantaneous Chemical Eq uilibrium (ICE ) can be used 
with extremely good resu lts llR.usselllll93i iTsuiil Il964l 
Il973l iMcCabe et al.lll979l etc.). Under the ICE approxi- 
mation, molecules and ions are assumed to form instan- 
taneously and their abundances depend only upon the 
local temperature and density. Another consequence of 
this assumption is that the specific reaction mechanisms 
that create and destroy a given molecule are irrelevant 
and only the molecule and its constituents are impor- 
tant. Concerning the solar atmosphere, it has been re- 
cently shown with the comparison between ICE calcula- 
tions and detailed chemical evolution calculations that the 
formation of the strong CO lines around 4.7 /jm can be 
well described using the ICE approximation for heights be- 
low ~700 km ijAsensio Ramos et aljEoQsT) . On the other 



* The National Center for Atmospheric Research (NCAR) is 
sponsored by the National Science Foundation 



hand, chemical evolution effects in regions of the atmo- 
sphere above this height are of importance in setting the 
CO abundance. 

In order to calculate the atomic and molecular number 
densities using the ICE approximation one needs to solve 
a non- linear system of equations (see below). Although 
efficient numerical methods exist for solving this kind of 
systems, the computing time when the ICE problem has 
to be solved in a large amount of points (e.g., dense grids, 
multi-dimensional geometries, iterative inversions, etc) be- 
comes prohibitive. It is then very important to develop a 
numerical method for the rapid solution of chemical equi- 
librium problems. 

Artificial Neural Networks (ANNs) have proven to be 
a p owerful appro ach to a broad variety of problems (see, 
In the solar community, they have been 
recently applied to the problem of inferring the magnetic 
field from observat ions of the polarization profiles of se- 
lected atom ic lines l|Carroll fc Staudei200lHSocas-Navarrol 
20031 12005|) . In this paper, we make use of the ability of 
ANNs with one hidden layer to approxima t e any non- 
linear continuous function (e.g.. Ijoneslll99ol iBlum fc Lil 
.1991) to solve the ICE problem. 

The structure of the paper is as follows. Section |21 de- 
scribes the ICE approximation and its standard solution. 
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Section |3| describes our approach to the ICE problem us- 
ing ANNs, discussing how it can be trained to associate 
a combination of physical parameters with the number 
density of each species included in the problem. Section 
0] details how the trained network can be applied to solve 
the ICE problem in realistic convection simulations of the 
solar atmosphere. In section we show the dependence 
of the outputs of the ANN on the physical parameters, 
which can be easily done because of the intrinsic analytic 
character of the mapping generated by the ANN. Finally, 
the most relevant conclusions are summarized in Section 

E 

2. Instantaneous chemical equilibrium 

2.1. Basic equations 

Consider three elements A, B and C that stick together 
to form a molecule A a BbC c , in which a, b and c are the 
stoichiometric coefficients indicating the number of times 
an element is present in the molecule. Because the explicit 
reactions which form the molecule are irrelevant under the 
ICE approximation, one needs only to consider the follow- 
ing dissociative process for the formation of the molecule 
A a BbC c : 

A a B b C c ^>aA + bB + cC. (1) 

This reaction is characterized by its dissociative reaction 
constant or equilibrium constant. It is given by the ratio 
between the product of the partial pressures of the indi- 
vidual elements and the partial pressure of the molecule: 



We now define the fictitious pressure P(i) as the pressure 
exerted by element i if all the gas were in the neutral 
form of the atomic species i. It is customary to specify 
the partial pressures of all the atomic species in terms of 
the fictitious pressure of hydrogen P{H). The coefficient 
relating one and the other is the abundance of each ele- 
ment with respect to hydrogen, A(i), so P(i) = A(i)P(H). 
The abundance of each atomic species depends on the 
metallicity of the star we are considering. For the solar 
ca se, we have used the standard solar abundances given 
bv lGrevessel l|l984|) . 

Under ICE conditions, the number of atoms and 
molecules are obtained by solving the conservation of mass 
and the chemical equilibrium conditions given by Eq. © . 
The conservation of mass establishes that the sum of 
the partial pressures of all the species containing a given 
atomic element (taking into account the stoichiometry) 
equals the fictitious pressure of the given element: 



P(i) = P + P l+ + Pi- 



£4^ 



(7) 



where Pi, Pi+ and P^- are the partial pressure of the neu- 
tral element i, and the ionized elements i + and i~ , respec- 
tively, while Pk is the partial pressure of molecule k which 
has species i in its composition. uj k is the stoichiometric 
coefficient which indicates the number of times element i 
appears in molecule k. The sum is extended over all the 
molecules in which the element i takes part. The previous 
equation can be rewritten with the aid of Eqs. J5J and (JSJ): 



K P {T) 



pa pb r>c 
r A r B r C 



Pabc 

where K p (T) is a function of the temperature and Pi the 
partial pressure of the species i. For example, the equi- 
librium constants for the dissociation of CO and H2 arc 
given by 



KC°(T) 



PcPq 
Pco ' 



K^(T) = pL. (3 ) 



The partial pressure is usually related to the number den- 
sity rii of a given species by the ideal gas equation: 



-Pi — Y^i. k'J. 



(4) 



where k is the Boltzmann constant and T the local tem- 
perature. This partial pressure is the pressure of the gas if 
no other species were present. The sum of all the partial 
pressures is the total pressure in the medium. 

Following the same line of reasoning, one can consider 
the ionization equilibria for the atomic species: 



A A + + e 
A + e~ A~, 

with their corresponding equilibrium constants: 



K A+ {T) 



P A +Pe- 

Pa 



K a -{T) 



Pa 



P A Pe 



(5) 



(G) 



(2) P(i) = p i+ K i+ -p- + K i -P i P e -+J2 



^1 



^3 "' 

K { p k) (T) 



(8) 

where i, j, . . . , I are the atomic species composing 
molecule k, u l k + ux^ + . . . + ui k = is the number of 

atoms of molecule k and K p k ^ (T) is its equilibrium con- 
stant. Charged molecules can be easily included by taking 
into account their dissociation and the ionization equilib- 
rium, simultaneously. Consider the dissociation equilib- 
rium of an ionized diatomic molecule. One only needs to 
take into account that when a charged molecule dissoci- 
ates, the atomic element which remains ionized is the one 
with the lowest ionization potential: 



AB+ & A~* 



B, 



(9) 



with Do (A) < Dq(B). Therefore, using Eq. 10) and Eq. 
© we can calculate the partial pressure of the ionized 
molecule: 



K P (T) 



Pa+Pb 



K A+ (T)P A P B 



AB+ 



P, 



AB+ 



(10) 



We can write an equation like Eq. © for each of the 
N s atomic species included in the calculation. In our case 
we have selected the 21 species which are shown in Table 
^ Therefore, once the molecular and ionization equilib- 
rium constants are known, we have a set of non-linear 
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algebraic equations which depend on the local tempera- 
ture and density. The ICE approximation is intrinsically 
local since it depends only on the local values of the tem- 
perature and density. Therefore, to calculate the molecu- 
lar number densities in a model atmosphere, we have to 
solve the algebraic non-linear system of equations at each 
point in the atmosphere. This system is sol ved using a 
stand ard Newton- Raphson iterative method I Press etall 
Il98fil) . Once the partial pressures for all the atomic species 
are known, we can calculate the ensuing atomic number 
densities by using Eq. @. The molecular number densi- 
ties are calculated by solving for their partial pressures 
in Eq. J2J, using the equilibrium constant appropriate for 
each molecule. 



2.2. Equilibrium constants 

As we discussed above, the problem of obtaining the 
atomic and molecular number densities is completely de- 
fined once the temperature and the total density (or pres- 
sure) is known. However, we also need to know the value 
of the equilibrium constant for each temperature. It is 
known from considerations of statistical mechanics that 
the value of the equilibrium constant can be obtained 
with the aid of the partition function of the molecule 
and of the individual atomic components, their respec- 
tive masses and the dissociation energy Do (see, e.g., 
iTeiero Ordonez fe Cernicharolll99lt for detailed informa- 
tion on how the equilibriu m constants can be calculated). 
The e xpression is given by l|Teiero Ordonez fc Cernicharol 
Il99ll and references therein): 

K V (T) = (fcT)^ -1 ' i t "'.\" ! l!'"< 



,J AVBVC 
4>ABC 



h 2 

,-D a /kT 



rriABC 



(11) 



where m, is the mass of atomic or molecular species i, 
N A = a + b + c is the number of atoms present in the 
molecule, <j>i is the partition function of species i. h and 
fc stand for the Planck and Boltzmann constants, respec- 
tively. The partition functions can be obtained by per- 
forming the summation 



(12) 



where gj is the degeneracy of level j and Ej its en- 
ergy. Ideally, the sum has to be extended over all 
the energy levels of the species. However, due to the 
difficulty of accounting for all the energy levels, the 
most complete available set of energy levels must be 
used. For molecules, the partition function has to in- 
clude the contribution from the electronic, vibrational 
and rotational levels. It is not practical to compute 
this summation for every ICE calculations. As a 
workaround, some authors h ave tabulated polynomial 
fits to the partit io n func ti ons jRussel]|ll934HTsuiilll97ll: 
ISauval fc Tatuml 1 19841: ITeiero Ordonez fc Cernicharol 



Il99l|). In this work, we make use o f the results obtained 
hv ITeiero Ordonez fc Cernicharol lll99lh . It represents 
one of the most complete compilations of molecular 
equilibrium constants for more than 240 diatomic and 
polyatomic molecules. These equilibrium constants were 
obtained by calculating the partition function with the 
most up-to-date molecular constants at the time of 
the work and their dependence with the temperature 
are given, as in previous works, as polynomial fits. 
One of the advantages of selecting this database is its 
self-consistency. 

3. Neural network 

3.1. Description of the network 

The ICE problem consists on obtaining the partial pres- 
sures of the atomic species (or the atomic species number 
densities) which are consistent with Eq. © once the lo- 
cal temperature, local hydrogen density and local electron 
density are given. One could also take a different point 
of view and see the ICE problem as a mapping between 
the three-dimensional space given by (T, ur , n e ) onto sev- 
eral one-dimensional spaces, one for each atomic species 
included in the problem. The functional form of the func- 
tions / : R — * R is not known. 

The Artificial Neural Network (ANN) with one hidden 
layer is a universal a pproximant to any non-linear con- 
tinuous function Ce.g.. lJoneslfl990l:lBlum fc Lilll99l[) . The 
schematic structure of such a network is shown in Fig. 
n We have constructed an ANN with an input layer of 
three neurons where the temperature, hydrogen density 
and electron density are introduced. The neurons of the 
input layer have a linear activation function. The hidden 
layer consists on Nh neurons with a non-linear activation 
function a(x). Each hidden neuron is connected to all the 
neurons of the input layer by a certain weight (indicated 
by arrows in the figure) . The value obtained at each hidden 
neuron is a linear combination of the values at the neurons 
of the input layer multiplied by the weights. These values 
are then applied the non- linear function a(x), multiplied 
by another set of weights and summed to give the final 
output of the neural network. The output of the network 
can be written as: 

N h 

N(T, n(H), n(e), m) 

(13) 

where the activation function is usually given by the sig- 
moid or the hyperbolic-tangent function. In this case, we 
have selected a{x) — tanh(x). In the previous expression, 
m formally represents the whole set of 5 x Nh weights 
which define the neural network. 

The use of a neural network approach for the solu- 
tion of the ICE problem is very appealing for several rea- 
sons. Firstly, we obtain an analytic function which is in- 
finitely differentiable and which could be easily used in any 
subsequent calculation. This way, we can investigate with 
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Fig. 1. Schematic structure of the neural network model 
used for approximating the ICE results. The input layer 
and the output layer arc linear, while the hidden layer 
is assumed to be non-linear, with an hyperbolic-tangent 
activation function. The input layer consists of three neu- 
rons for the temperature, hydrogen number density and 
electronic number density. The output neuron gives the 
partial pressure of element i for the physical conditions in 
the input layer. 

great detail the dependence of the atomic and/or molec- 
ular abundances with temperature, hydrogen density and 
electron density. Secondly, the powerful approximation ca- 
pabilities of ANNs make them very suitable for multi- 
dimensional interpolation problems. Even more striking is 
the ability of ANNs to extrapolate data outside the range 
of parameters used in the learning process. A direct con- 
sequence of these properties is that the number of points 
that need to be used for the learning process can be very 
small compared to the case of standard interpolation tech- 
niques. Furthermore, the number of parameters needed for 
performing the interpolation is also small. Essentially, one 
transforms the problem from storing in memory the whole 
set of N points which relates T, n{H) and n{e) with the 
partial pressures Pi for every species to storing only the 
5 x Nh Tn weights present in Eq. I|13|) for each species. 
This reduction is proportional to the ratio N/Nh, which 
can be very large for large, as we will see below. Finally, it 
is also important to note that the neural network can be 
straightforwardly implemented in parallel architectures. 

3.2. Learning process 

In order to obtain a neural network which is able to ap- 
proximate the abundance of each species included in the 
chemical equilibrium calculation, we randomly select Ni 
sets of temperature, hydrogen density and electronic den- 
sity. For each one of these combinations, we solve the 
ICE equations and calculate the abundances of all the 
atomic species. We have verified that taking Ni — 1000 



gives a good coverage of the parameter space, consider- 
ing that they are randomly selected. The temperature is 
varied in the range from 3500 K and 14000 K, which is 
representative of the physical conditions in typical stel- 
lar atmospheres. Concerning the hydrogen and electronic 
abundances, we decided to use a logarithmic scaling. The 
hydrogen density is varied from 10 12 cm -3 to 10 18 cm~ 3 
while the electronic density is varied from 10 s cm' 3 to 
10 17 cm" 3 . Again, these values are a reasonable represen- 
tation of realistic conditions. 

Once the learning set is selected, the training of the 
network for each species i reduces to a minimization of 
the following error function: 

E * = E [ P l ~ N{Ti,ni{~R), m (e), m)} 2 , (14) 
l=i 

so that the optimum values of the parameters m are ob- 
tained. P\ is the partial pressure (or number density) of 
species i obtained for the combination of physical condi- 
tions I. Since almost every minimization algorithm makes 
use of the derivatives of the error function with respect to 
the parameters, it is of interest to notice that they can be 
obtained analytically: 

■P- = -2 V [P, - i\r(T,,n,(H), n,(e), m)) (15) 
k i=i k 

where the partial derivatives of the neural network with 
respect to the parameters are obtained in a straightfor- 
ward manner by using Eq. (|13|l . In order to obtain the set 
of parameters m which minimize the err or function, we 
have used a quasi-Newton BFGS met hod llFletcherlll987t) 
implemented in the Merlin package ( Papaeeorgiou et al. 
1998) which presents a very good convergence rate. 

It is interesting to note that the usual approach of 
training the neural network with a learning set obtained 
from the original problem can be circumvented here. We 
do not need to rely on the precision given by the Newton- 
Raphson solution and the neural network can be made 
as precise as desired. This is a consequence of the fact 
that the equations describing the ICE approximations are 
known. One can see that the equations of ICE can be for- 
mally written as f(T, n(H), n(e), {Pi}) — 0, where {Pi} is 
the set of N s partial pressures (or atomic number densi- 
ties) which satisfy the equations. Therefore, we can build 
as many neural networks as species included in the ICE 
problem and train the network by minimizing the follow- 
ing error function: 

Ni 

E = Y,[f(Ti,n l (R),n l (e),{N l (T l ,n l (R),n l (e),m)})] 2 . 
i=i 

(16) 

The solution to the problem is transformed into finding the 
set of parameters m for all the networks which minimizes 
the previous error function. The summation is extended 
over the Ni elements of the learning set. Several issues are 
noteworthy in this case. On the one hand, the derivatives 
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Fig. 2. Relative error obtained after the training of the neural network for hydrogen, oxygen, carbon and iron. The 
distribution of relative errors are shown for the learning set and for a verification set, which aids at detecting over- 
training. We indicate the value of the standard deviation a of the distribution and the number of points below la, 2a 
and 3a. We also indicate these values in the plots as horizontal lines. 



of the error function with respect to the parameters m 
are not as straightforward as those found in Eq. I|15l) since 
the function f(T,n(H),n(e), {Pi}) = is non-linear. On 
the other hand, the minimization process has to be carried 
out simultaneously for the N s neural networks, so that the 
problem of finding the minimum might be more difficult 
to solve. The dimension of the hypersurfacc in which we 
have to find the minimum is 5 x Nh x N s , instead of finding 
N s minima in hypersurfaces of dimension 5 x Nh- 

A critical parameter in the training ability of a neural 
network is the number of hidden neurons Nh- It is impor- 
tant to build networks with sufficient number of hidden 
networks to accurately approximate the non-linear func- 
tion we are interested in. However, since the number of 
learning points is not infinite, the number of hidden neu- 
rons one can use is practically limited by the overtraining 
phenomenon (in principle, the number of training points 
should be much larger than the number of hidden neu- 
rons). If Nh is very large, the network is capable of cor- 
rectly approximating all the points in the learning set but 
its interpolation capacity is lost, producing large oscilla- 
tions between the points of the learning set. This is a di- 
rect consequence of the increase in the degrees of freedom 
of the network when the number of hidden neurons is in- 



creased. One way to avoid this overtraining is to use two 
different data sets: one for learning purposes and the other 
for testing. When the training process takes places, the er- 
ror given by expression (|14|l is evaluated for the training 
set and for the test set. When the network is training cor- 
rectly, both errors are reduced. When the network starts to 
be overtrained, the error of the test set starts to increase. 
We have followed this scheme in order to stop the train- 
ing process before overtraining. In our case, we have used 
two different values for the number of hidden neurons. We 
have verified that Nh = 20 gives sufficiently accurate re- 
sults for 13 of the 21 atomic species included in the ICE 
calculations, while we were forced to increase this num- 
ber to Nh = 30 for the rest of species. Table ^ lists the 
number of hidden neurons for each species. We did not 
investigate the dependence of the minimum error on the 
number of hidden neurons so that the increase to Nh = 30 
was somewhat arbitrary. 

The training of the neural networks was stopped when 
the standard deviation of the relative error between the 
output of the neural network and the values obtained from 
the solution of the ICE equations for both the learning and 
the test sets was 1% or below. Obviously, the relative er- 
ror was always larger for the test set than for the learning 
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2 4 6 8 10 13 14 1 2 3 4 5 

Relative error [%] Relative error [%] 

Fig. 3. Histog r ams o f relative errors for the 2.55xl0 5 points of the three-dimensional convection simulation of 
lAsnlund et al.l ^2000^1 . We indicate the position of la, 2a and 3a of the distribution. Note that they are similar 
to those obtained for the verification set during the training of the networks. 



Table 1. Number of hidden neurons used for each ANN. 



Species 


Number of hidden neurons 


He, Na, Mg, Al, P, K, Ca 


20 


Ti, Cr, Mn, Fe, Ni, Cu 




H, C, N, O, F, Si, S, CI 


30 



set, but the training always reached the limit of 1% and 
was stopped before any indication of overtraining was ob- 
served. In Fig. [5] we present the relative error obtained for 
the 1000 training points for the abundances of H, C, O and 
Fe. The behavior of the relative error for the rest of species 
follows the same behavior, except for He, which presents a 
much smaller dispersion. Since He cannot form molecules, 
it is always in its atomic form and the non-linear mapping 
that the neural network has to learn is strongly simplified. 

We have indicated in each plot the value of the stan- 
dard deviation and the percentage of points which are con- 
tained within lcr, 2a and 3a. This is an indication of the 
precision obtained with the neural network. It is common 
in all the networks that more than ~ 95% of the points are 
inside 3a, which translates into a relative error of ~ 3% 
(since a is always of the order or below 1%). However, 
we note that it is common to all the networks that more 
than 75% of the points are below la. Along with the plots 



of the results obtained for the learning set, we have in- 
cluded the results for the verification set. It is interesting 
to note that, although the standard deviation of the dis- 
tribution of relative errors is larger than that obtained for 
the learning set, typically more than ~ 80% of the points 
are below this limit. In the case of the network for the 
carbon abundance, we have a relatively high value for a, 
but more than 90% of the points are below this limit. 

4. Comparison of results 

Once the neural networks were trained to the desired pre- 
cision, as shown in Fig. [21 we have applied them to realistic 
situations in order to compare their performance with the 
complete ICE calculations. To this end, we make use of a 
three-dimensional snapsh ot of the conve c tion s imulation 
of the solar atmosphere o solun d et al . (2000). We con- 
sider this simulation to be an adequate representation of 
the physical conditions in the solar atmosphere since sev- 
eral investigations are reporting good agreements between 
the synthetic line profiles and th e observed ones (e.g., 
Shchukina fc Truiillo Buenol l200ll lAsplund et all I2004L 
2003). The snapshot box size is 50x50x102, so that the 
number of points is 2.55xl0 5 . We have solved the ICE 
problem in each point and confronted the results with 
those obtained from the neural networks. 
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Fig. 4. Number density of carbon and oxygen obtained with the simplified model and with the neural network for two 
different combinations of hydrogen and electron densities and for different values of the temperature. Note that this 
kind of investigation is greatly simplified due to the analytical character of the neural network. 



The normalized error histograms are shown in Fig. 
for H, C, O and Fe, with vertical dashed lines indicating 
the position of la, 2a and 3a of the distribution. Note 
that the histograms are decreasing functions of the rela- 
tive error, so that the number of points with large rela- 
tive error is much smaller than the number of points with 
small relative errors. It is verified that the value of the 
standard deviation of the distribution of relative errors 
is quite similar to those obtained for the verification set. 
This reinforces our confidence that the learning set and 
the verification set provide sufficient coverage of the three- 
dimensional parameter space (T,n(H),n(e)). The shapes 
of the histograms turn out to be fairly linear in this log- 
linear scale, so that we can consider, as a first approxima- 
tion, that the number of points N p with a certain relative 
error r is given by N p cx r~ a . For example, we find that 
a ~ 5 for hydrogen, a ~ 3.7 for carbon and a ~ 7 for oxy- 
gen. There is an exception for iron, for which the distri- 
bution of points with relative errors below ~ 2.3% seems 
to be constant, while the number of points with errors 
larger than ~ 2.3% is three order of magnitude smaller. 
The behavior of the histograms for the rest of species is 
very similar to those shown in Fig.^lfor the selected ones, 
maintaining the properties obtained for the verification 
sets. 



5. T, n(H) and N(e) dependence 

Since the neural network approach leads to analytic for- 
mulae for the solution of ICE problems, we now take ad- 
vantage of them and compare the results for abundant 
species with simplified models. Our aim is to investigate 
the variation with temperature, hydrogen density and 
electronic density of the number density of carbon and 
oxygen. We have selected these two species because they 
are the constituents of the highly abundant CO diatomic 
molecule. This way, we can safely obtain an approximation 
to the carbon and oxygen abundances by taking only into 
account the formation of CO in the ICE equations and ne- 



glecting the rest of molecular species. Note that this sim- 
plification is only valid for species like C and O and not for 
species which form less abundant molecules and/or form 
several high abundance molecules. Specializing Eq. (jSJ) to 
oxygen and carbon, we obtain: 

PrPn Pr 

nc) = p c + 1 ^ + K c+1 ±- + K c -P C P e - 

P(0)=Po + -^% + K o+ ^ + K o -P o P e -. (17) 
^co(J J P e - 

This system of equations can be solved analytically for 
Pc and Po and transformed into number densities by us- 
ing the ideal gas equation Eq. Their dependence with 
temperature for two different combinations of hydrogen 
and electronic densities are shown in Fig. ^ The output 
of the neural network is also shown in the plots with solid 
lines. Note the accurate results obtained with this simpli- 
fied model in this high abundance case. This behavior can 
be better understood if we plot separately the contribu- 
tion to the total pressure of each element of each term in 
Eq. lJT7}l. The results are shown in Fig. for the two sets 
of physical conditions. The fictitious pressure of carbon 
and oxygen (essentially the abundance of carbon and/or 
oxygen times the hydrogen density) is shown by solid line, 
the partial pressure of the neutral element by dashed line, 
the partial pressure of the carbon monoxide molecule by 
dash-dotted line and the partial pressure of the ionized 
element by long dashes. All the quantities shown in the 
plots have been obtained using the neural network ap- 
proach. Although the neural network takes into account 
the formation of other molecules which contain carbon 
and oxygen (i.e., it gives the solution to the complete ICE 
problem), we see that accounting only for CO appears to 
be enough since the addition of all the contribution closely 
approximates the value of the fictitious pressure. 

Note that the CO molecule is efficiently formed when 
the temperature is below ^5000 K in both situations. The 
neutral species are mainly situated at intermediate tem- 
peratures, typical from photospheric regions, while the 
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Fig. 5. Contribution to the total pressure of carbon and oxygen in two different cases. The curves have been obtained 
using the neural networks. The addition of all the contribution closely matches the fictitious pressure of carbon and 
oxygen. This is the reason why the very simplified model for C and O works very well. 



ionized species tend to dominate for high temperatures, 
above 6000 - 7000 K. 



6. Conclusion 

We have successfully trained 21 neural networks which 
approximate the solution to the Instantaneous Chemical 
Equilibrium problem. We have generated a learning set 
which is representative of the physical conditions in stel- 
lar atmospheres and have verified that the coverage of 
the three-dimensional parameter space is sufficiently good 
with only 1000 points. We have trained the neural net- 
works so that the standard deviation of the relative er- 
ror were below or of the order of 1%. In order to avoid 
overtraining, we have employed an independent verifica- 
tion set. The standard deviation of the relative error in 
this verification set is larger than that for the learning 
set but also of the order of 1%. We apply the neural net- 
works obtained to the ICE problem in a three dimensional 
convection simulation, representative of the physical con- 
ditions in the solar atmosphere. The histograms of rela- 
tive errors built with all the points in the simulation show 
that the neural networks are capable of solving the ICE 
problem with relative errors similar to those obtained for 
the verification set. The advantage of the neural network 
approach is their intrinsic analytical character, the possi- 
bility of parallclization and the very fast evaluation. The 
development of a fast approach to ICE is of importance 
for, among others, recent multi-dimensional simulations 
of stellar atmospheres which include very dense grids and 
iterative inversions of observed spectra. 
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